Incorporating Opponent Models into Adversary Search
نویسندگان
چکیده
This work presents a generalized theoretical framework that allows incorporation of opponent models into adversary search. We present the M∗ algorithm, a generalization of minimax that uses an arbitrary opponent model to simulate the opponent’s search. The opponent model is a recursive structure consisting of the opponent’s evaluation function and its model of the player. We demonstrate experimentally the potential benefit of using an opponent model. Pruning in M∗ is impossible in the general case. We prove a sufficient condition for pruning and present the αβ∗ algorithm which returns the M∗ value of a tree while searching only necessary branches.
منابع مشابه
The M � Algorithm � Incorporating Opponent Models into Adversary Search
While human players adjust their playing strategy according to their opponent computer programs which are based on the minimax algorithm use tha same playing strategy against a novice as against an expert This is due to the assumption of minimax that the opponent uses the same strategy as the player This work studies the problem of opponent modelling in game playing We recursively de ne a playe...
متن کاملPruning Algorithms for Multi-model Adversary Search
The Multi-model search framework generalizes minimax to allow exploitation of recursive opponent models. In this work we consider adding pruning to the multi-model search. We prove a sufficient condition that enables pruning and describe two pruning algorithms, αβ∗ and αβ∗ 1p. We prove correctness and optimality of the algorithms and provide an experimental study of their pruning power. We show...
متن کاملUsing Reflective Learning to Master Opponent Strategy in a Competitive Environment
Cognitive models of people interacting in competitive environments can be useful, especially in games and simulations. To be successful in such environments, it is necessary to quickly learn the strategy used by the opponent. In addition, as the opponent adjusts its tactics, it is equally important to quickly unlearn opponent strategies that are no longer used. In this paper, we present human p...
متن کاملIntroducing Hierarchy to Non-Hierarchical Planning Models – A Case Study for Behavioral Adversary Models
Hierarchical planning approaches are often pursued when it comes to a real-world application scenario, because they allow for incorporating additional expert knowledge into the domain. That knowledge can be used both for improving plan explanations and for reducing the explored search space. In case a non-hierarchical planning model is already available, for instance because a bottom-up modelin...
متن کاملOnline Multiagent Learning against Memory Bounded Adversaries
The traditional agenda in Multiagent Learning (MAL) has been to develop learners that guarantee convergence to an equilibrium in self-play or that converge to playing the best response against an opponent using one of a fixed set of known targeted strategies. This paper introduces an algorithm called Learn or Exploit for Adversary Induced Markov Decision Process (LoE-AIM) that targets optimalit...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996